Picture for Yifan Zhu

Yifan Zhu

Stanford University Department of Electrical Engineering

OmniRAG-Agent: Agentic Omnimodal Reasoning for Low-Resource Long Audio-Video Question Answering

Add code
Feb 03, 2026
Viaarxiv icon

Continual GUI Agents

Add code
Jan 29, 2026
Viaarxiv icon

Token-Guard: Towards Token-Level Hallucination Control via Self-Checking Decoding

Add code
Jan 29, 2026
Viaarxiv icon

Sawtooth Wavefront Reordering: Enhanced CuTile FlashAttention on NVIDIA GB10

Add code
Jan 26, 2026
Viaarxiv icon

Vision Language Models for Optimization-Driven Intent Processing in Autonomous Networks

Add code
Jan 19, 2026
Viaarxiv icon

Prior Diffusiveness and Regret in the Linear-Gaussian Bandit

Add code
Jan 05, 2026
Viaarxiv icon

CreBench: Human-Aligned Creativity Evaluation from Idea to Process to Product

Add code
Nov 17, 2025
Viaarxiv icon

More Than Irrational: Modeling Belief-Biased Agents

Add code
Nov 15, 2025
Viaarxiv icon

TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making

Add code
Sep 10, 2025
Figure 1 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 2 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 3 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Figure 4 for TCPO: Thought-Centric Preference Optimization for Effective Embodied Decision-making
Viaarxiv icon

A Foundation Model for Chest X-ray Interpretation with Grounded Reasoning via Online Reinforcement Learning

Add code
Sep 04, 2025
Viaarxiv icon